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Selection System 

This is a continuation of International Application GB99/01526, with an international 
filing date of May 13, 1999, now abandoned. The invention relates to a selection system 
which permits the selection of polypeptides displayed in a phage display system. 

Viruses have been used for the display of peptides and proteins [21, 26, 44]. In 
particular filamentous bacteriophage have been used for display of proteins and peptides 
by fusion of the genes encoding the proteins or peptides to [he gene encoding a phage 
coat protein. As the fusion gene is encapsidated in the phage that is displaying the 
fusion protein, this provides a linkage of phenotype and genotype. Repertoires of 
proteins can be encoded by a population of phage, and the rare phage comprising 
proteins with predefined binding activities isolated by binding to solid phase. In this 
way synthetic human antibodies of predefined antigen-binding specificity have been 
selected from repertoires of antibody fragments assembled from different structural 
elements [10]. As the antibody needs to be folded to bind antigen, selection for binding 
also selects for folding. This principle has also been used for selection of folded 
peptides where binding is mediated by a discontinuous epitope [8, 11-13]. 

A problem present in phage display systems is the presence of high levels of 
background caused by the presence of phage not displaying desired polypeptides. For 
example, antibody repertoires are commonly encoded as fusion proteins with the p3 
protein on phagemid vectors and are encapsidated by the use of helper phage. The 
helper phage coat protein competes with the fusion of antibody and coat protein 
(encoded on the phagemid), leading to phage with "monovalent" rather than multivalent 
display of folded antibody fragments. This can be useful in discriminating between the 
affinity and the avidity (with multivalent display) of the antibodies displayed on the 
phage. However the great majority of phages only display the helper phage coat protein 
which contributes to a "background" binding to antigen/* In this case it is desirable to 
select for phages that display folded antibodies, and to eliminate those that do not. 

Moreover, all of the systems in current use rely on a binding activity in the polypeptide 
to be selected in order to perform the isolation of the desired display bodies from those 
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which do not encode polypeptides having a desired characteristic. This places a 
limitation on available display systems to the selection of folded polypeptides which 
possess a known binding activity. It would be desirable to have a means for selection of 
displayed proteins or polypeptides that is independent of the binding activity thereof. 

5 

For example, there is considerable interest in building folded proteins de novo. 
Attempts have been made to design proteins de novo by assembly of predefined 
elements of secondary structure and also from random sequences (for review [5]). In 
some cases the designed proteins have been shown to retain elements of secondary 

10 structure but lack the stable tertiary interactions characteristic of the folding of native 
proteins, suggesting the presence of molten globules (see [6] and references therein). 
More successful has been the creation of native-like proteins based on a pre-existing 
backbone [7, 8]. In these cases the binding activities of a de novo designed protein will 
be unknown. In this case it is desirable to select for phages displaying folded proteins, 

15 and to eliminate those that do not. 

Although anempts have been made to screen for folded proteins by their ability to 
survive degrading enzymes in bacteria [16-18], such methods do not allow for selection 
if bacterial growth or survival does not depend on the function of the folded protein. 
20 Thus, these systems are only applicable to a small minority of polypeptides which one 
might wish to select according to the ability to fold. 

It has previously been shown that the insertion of a peptide sequence between a 
proteolytically stable tag fused to the minor phage coat protein p3 and the p3 protein 

25 itself, followed by proteolysis, provides a means to select for phages bearing peptide 
sequences that are susceptible to proteolysis [19, US Patent 5,780,279]. In these 
experiments, phage are bound to an affinity resin binding an N-terminal, proteolytically 
stable tag on the phage. If the bound phage are subjected to proteolysis and elution, 
only phage with cleavable sequences are eluted. This method is used to identify, among 

^0 a repertoire displayed on phage, amino acid sequences suitable as substrates for 
proteases. The sequences introduced are short and would not be capable of folding 
independently. Moreover, the system selects specifically for eluted rather than bound 
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phage; in other words, it is specifically configured to isolate cleaved rather than 
uncleaved phage. 

Summary of the Invention 

5 

The present invention exploits the application of peptide cleavage to eliminate unwanted 
viruses. 

According to a first aspect, therefore, the invention provides a method for the selection 

10 of a virus comprising the steps of: 

(a) providing a virus encoding and displaying a fusion polypeptide, said fusion 
polypeptide comprising a heterologous polypeptide inserted into the sequence of a viral 
coat protein polypeptide, wherein said virus comprises a cleavable site located within a 
displayed polypeptide; 

15 (b) exposing the virus to a cleaving agent; 

(c) propagating the virus comprising intact fusion protein. 

According to the present invention, virus may be selected by cleavage of non-resistant 
virions using a cleaving agent. As used herein, i% virus" refers to an infective 

20 inocculum of virions, which may incorporate cleavage sites, optionally as part of 
heterologous polypeptides encoded by the viral genome. Thus, "virus" may refer to a 
plurality of virions, such that it may encode a repertoire of polypeptides; alternatively, 
as the context requires, it may be used to denote a single virion. The term "virus" 
includes any suitable virus which may incorporate a cleavage site, either naturally or 

25 through manipulation. A preferred virus for use in the present invention is 
bacteriophage, preferably filamentous bacteriophage. 

The term "polypeptide" is used generally to denote molecules constructed of a plurality 
of amino acids, the amino acids being joined together covalently such as through 
30 peptide bonds. "Fusion" polypeptides are essentially polypeptides which are 
incorporated into viral coat proteins, such that a fusion is created between the viral coat 
protein and the polypeptide in question. The fusion may incorporate the polypeptide 
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into the viral coat protein, advantageously between domains thereof, or place it at one 
end thereof, to make a terminal fusion. The polypeptide is referred to as a 
"heterologous" polypeptide, to denote that it is heterologous to the viral coat protein 
into which it is inserted. It is possible, however, that it is derived from another 
5 polypeptide of said virus. 

In one sense, polypeptide is used interchangeably with "protein" herein, in that no 
difference of structure or size is implied. Substantially any polypeptide may be selected 
for by the method of the present invention, including structural polypeptides, 

10 polypeptides having enzymatic activity and polypeptides having binding activity, 
including antibodies and antibody fragments. Cleavage sites may be present in the 
polypeptides, and may be naturally-occurring or may be engineered into the polypeptide 
or into a linker peptide attached thereto. "Polypeptide v may also refer to inserted 
polypeptides which are essentially non-folding polypeptides and serve to encode a 

15 cleavable site and insert this site into the coat protein of a virus. Inserted polypeptides 
may take the form of N- or C-terminal fusions, or may form part of the coat protein 
itself. 

A "cleavable site" is a site capable of cleavage when exposed to a cleaving agent. In 
20 the present invention, the use of protease cleavage sites, capable of being cleaved with 
proteases, is preferred. Protease cleavage sites may be part of, or incorporated in, 
polypeptides according to the invention; alternatively, it may be independently 
engineered into the coat protein of the virus. A feature of the cleavable site is that it 
should either be absent from the virus other than at the site of its specific insertion 
25 according to the present invention, or otherwise inaccessible to cleavage, or present 
only in viral proteins which are not required after virion assembly to mediate infection. 

In accordance with the invention, the cleavable site may 'be inserted into or present in 
any suitable position in the virus. Advantageously, however, it is inserted into or 
30 present in either the coat protein itself or the heterologous polypeptide which forms part 
of the fusion polypeptide. 
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In a preferred aspect of the present invention, more than one cleavable site may be 
used. For example, one site may be inserted or otherwise known to be present in the 
virus, whereas the presence of another site may be unknown or dependent on 
randomisation of the heterologous polypeptide sequence. In a particularly preferred 
5 embodiment, the cleavable sites may comprise a protease cleavable site and a bond 
formed through sidechains on one or more amino acids, such as a disulphide bond. 
Disuiphide bonds are cleavable by reducing agents, such as DTT or ji-mercaptoethanol. 

If a virus contains a disulphide bond, cleavage of a protease cleavable site located 
10 between the two cysteine residues which form the bond through their sidechains will not 
lead to loss of viral infectivity since the disulphide bond is capable of retaining the 
covalent linkage of the viral polypeptides. 

Thus, the invention further provides a method for identifying the presence of disulphide 
15 bonds in polypeptides. 

Conversely, in the event that the selection of disulphide-comaining polypeptides is not 
desired, viruses are advantageously treated with a reducing agent before or after 
proteolysis, in order to eliminate background due to viruses which have been cleaved by 
20 the protease but which have been held together by disulphides. 

The fusion polypeptide may comprise one or more heterologous polypeptides. In the 
case of a terminal fusion, one such heterologous polypeptide may function as a protein 
tag, allowing phage which express the fusion polypeptide to be identified. The 
25 cleavable site may, in such a case, be positioned in or near the tag, such that cleavage 
of the cleavable site releases the tag. 

A tag is any suitable entity capable of binding to a iigand which may be used to isolate a 
virus by the method of the present invention. Accordingly, the tag is resistant to the 
30 cleaving agent used in the method of the invention. Examples of tag/ligand pairs 
include barnase/barstar, avidin/biotin, antibody or antibody fragments and ligands, 
chelating groups and chelates, for example metals, and the like. 
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In all embodiments of the present invention, which are described in greater detail 
below, the uncleaved polypeptides are selected for; the cleaved material is discarded in 
the selection step. 

5 Preferably, the virus according to the invention encode a repertoire of heterologous 
polypeptides. A repertoire is a collection of members, preferably polypeptides, which 
differ slightly from each other in a random or partially randomised manner. Preferably, 
a repertoire of polypeptides is a collection of variant polypeptides which preferably 
incorporate random or partially randomised mutations. As used herein, a repertoire 
10 preferably consist of 10 4 members of more. A repertoire advantageously comprises a 
very large number of members, typically between 10 s and 10 11 , and potentially 10 14 or 
higher. 

The heterologous polypeptide, or repertoire of such polypeptides, is advantageously 
15 displayed on the surface of the virus which encodes it, by virtue of its being 
incorporated into a coat protein, or on the surface of cells infected by the virus. Where 
the virus is a bacteriophage, the protein may also be displayed on the surface of bacteria 
infected with the bacteriophage. 

20 The cleavable site is advantageously located in or adjacent to the heterologous 
polypeptide, such that it can be protected by folding of the heterologous polypeptide and 
thus allow selection for heterologous polypeptides which are capable of correct folding. 
Alternatively, however, the cleavable site may be located distal to the heterologous 
polypeptide; in such embodiments, the cleavable site may serve to permit reduction of 

25 background in phage display techniques. For example, introduction of the cleavable 
site into helper phage used with phagemid encoding a repertoire of polypeptides allows 
helper phage to be removed by cleavage prior to infection of host cells, thus 
dramatically reducing background due to "empty" phage. Advantageously, therefore, 
the cleavable site is incorporated into the virus coat protein. 

30 

As referred to herein, a phagemid is a plasmid cloning vector which comprises viral 
replication sequences but is deficient in at least one viral function. This means that 
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whilst phagemid may be inserted into host cells by conventional nucleic acid transfer 
methods, and will exist in the host cells in an episomal state, they are unable to 
assemble into virions and thus complete a viral cycle of infection. Helper phage are 
used to supply the deficient viral functions and permit the phagemid to be packaged into 
5 virions. In accordance with the invention, phagemid may encode coat protein fusions 
with heterologous polypeptides which incorporate a cleavable site. 

Helper phage provide the viral function lacking in phagemid in order to allow 
packaging of the phagemid into virions. According to the invention, helper phage may 
10 be modified in order to render them cleavable by a cleaving agent, for example a 
protease. In one aspect of the present invention, helper phage may incorporate a coat 
protein having a cleavable site which, when cleaved in the "rescued" progeny phage, 
will render the helper phage-derived coat protein unable to mediate infection. 

15 As will be apparent from the forgoing, in the method according to the present invention 
the virus which are resistant to cleavage are selected. Advantageously, the resistant 
virions will be selected by infection of susceptible host cells, such as bacterial host 
cells. Cleaved virions are not infective. Alternatively, binding of virions to a ligand, 
for example via a tag, is dependent on protection of a cleavable site in the virus, such 

20 that viruses which are cleaved are not isolated by ligand/tag binding. 
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Detailed Description of the Invention 

The present invention may be configured in a number of ways according to the intended 
procedure to which it is desired to apply the basic methodology. Selective cleavage of 
5 virions is used to reduce background in phage display techniques. By cleaving and 
removing phage which either do not contain a heterologous polypeptide, or express a 
heterologous polypeptide which is not capable of correct folding, the sensitivity of a 
phage display process can be increased substantially. As demonstrated in the 
experimental section below, using the methodology of the invention phage display may 
10 be used to select polypeptides which are not susceptible to selection by display 
techniques according to prior art methods. 

In a first configuration, cleavage of a cleavable site in a virion coat protein may be 
exploited to reduce background attributable to phage displaying no heterologous 

15 polypeptides, or phage displaying heterologous polypeptides which are incapable of 
folding correctly. Cleavage of displayed polypeptides, in accordance with the present 
invention, . results in the impairment of the viruses to achieve infection of host cells. 
Thus, by propagating viruses which have been exposed to a cleavage agent, it is 
possible to enrich the virus for virions which comprise displayed polypeptides w r hich are 

20 resistant to cleavage. As used herein, "impair" means to reduce; it thus includes but 
partial and complete prevention of infection of host cells by affected virus. 

The coat protein is selected as the site for cleavage on the grounds that it is available to 
cleaving agents in the host cells harbouring the virus or at the surface of the virus itself. 
25 Thus, in a preferred embodiment, virus preparations may be treated with a cleaving 
agent, in order to render virions having cleavable coat proteins unable to mediate 
infection of host cells. Alternatively, cells infected with the virus may be treated with a 
cleaving agent active within the cell, which will prevent packaging of virus comprising 
a cleavable coat protein. 



30 



9 



According to the present invention, reference to selection may be interpreted as a 
reference to screening, since the same processes may be used to screen phage, as will 
be apparent to persons skilled in the art. 

5 Cleavable sites may be naturally part of the coat protein, but preferably they are 
engineered therein. Preferred cleavable sites include protease cleavage sites, which 
may be found in polypeptides or engineered as an integral part of their sequence. 
Typically, protease cleavage sites may be defined in terms of amino acid sequences 
which are susceptible to cleavage by a protease. For example, the invention 

10 encompasses the use of protease cleavage sites cleavable by one or more of the 
proteases trypsin (cleaves at Lys, Arg), chymotrypsin (Phe, Trp, Tyr, Leu), 
thermolysin (small aliphatic residues), subtilisin (small aliphatic residues), Glu-C (Glu). 
Factor Xa (Ile/Leu-Glu-Gly-Arg), Arg-C (Arg) and thrombin. 

15 Protease cleavage sites may be incorporated into the coat protein of a virus by 
constructing a fusion between the coat protein and a further polypeptide, the further 
polypeptide containing the cleavage site. The further polypeptide should be inserted at 
a position in the viral coat protein such that it allows the assembly of a functional viral 
capsid and subsequent infection, but if cleaved will result in the impairment of 

20 infectivity. 

If the protease cleavage site incorporated in the coat protein remains uncleaved, 
therefore, the virus is capable of assembly into functional virions and retains the 
potential to infect host cells. If the protease cleavage site is cleaved, however, the 
25 structure of the viral coat protein will be compromised and the viral will lose at least 
part of its potential to infect host cells. 

In a preferred embodiment, the virus for use in the present invention is a bacteriophage, 
preferably filamentous bacteriophage. Filamentous bacteriophage is widely used in 
30 phage display techniques for the selection of polypeptides from phage libraries encoding 
a large repertoire thereof. Conventionally, the repertoire of polypeptides is inserted in 
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the p3 protein of filamentous bacteriophage, but any other suitable site may be 
employed within the scope of the present invention. 

In the case of the p3 protein of filamentous bacteriophage the protein consists of three 
domains. The N-terminal Dl is involved in binding to the tolA receptor, D2 in binding 
to the F-pilus (and mediating infection) and D3 in anchoring the protein to the phage 
particle. Peptides and proteins can be inserted at the domain boundaries without 
abolishing infectivity [21, 22], but the presence of all the domains is essential for phage 
infectivity [23]. The bacteriophage are resistant to proteolysis (allowing their use as 
''substrate" phage, [19]), but the introduction of polypeptides comprising protease 
cleavage sites into p3, for example at the junctions between domains leads to loss in 
infectivity of the phage upon proteolysis. 

The protease cleavage sites may be incorporated into heterologous polypeptides. As 
described above, heterologous polypeptides may be encoded in the form of a repertoire 
in a phage library. As folded polypeptides or proteins are often resistant to proteolysis 
and unfolded proteins are sensitive, cleavage requires the polypeptide chain to bind and 
adapt to the specific stereochemistry of the protease active site, and therefore to be 
flexible, accessible and capable of local unfolding [14, 15]. The cloning of a 
polypeptide comprising protease cleavage sites at the domain junctions of p3, followed 
by proteolysis, provides a means of selection for phages bearing proteins that are 
resistant to proteolysis and are folded. 

In the case of phage display repertoires (wherein the polypeptide to be selected is cloned 
at the N-terminus of p3) encoded on phagemid vectors, the use of helper phage 
comprising a polypeptide comprising protease cleavage sites at the domain boundaries, 
followed by proteolysis, provides a means of selection for phages that display the fusion 
protein by eliminating the helper phage after the "help" has been given. 

The use of the protease-cleavable helper followed by protease cleavage selects for 
phages bearing the fusion protein (and for good display). As many phages in a 
repertoire do not display fusion proteins [26] and these contribute to non-specific 
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binding of the phage, this should also improve selection efficiencies. When using the 
techniques of the prior art, only between 0.1-1% of all phage particles in a phage 
library may comprise a gene 3 protein arising from the phagemid. Therefore the 
majority (99-99.9 %) of phage particles that have bound non-specificaliy to the solid 
5 support used in selection will comprise p3 from the helper phage (irrespective of the 
genome carried by the phage particle which most likely will be a phagemid DNA), 
these panicles are rendered non-infective by proteolytic cleavage. 

According to a third embodiment, the selection process may be used for identification 
10 of interacting protein elements. If two such elements linked by a polypeptide 
comprising protease cleavage sites are cloned between the D2 and D3 domains for 
display on phage, the only infectious phages after proteolysis are those in which the D2 
and D3 domains are held together by non-covalent interactions between the interacting 
protein elements. The invention accordingly permits selection of a repertoire of 
15 polypeptides for its ability to interact with a selected polypeptide, or a second repertoire 
of polypeptides. Unlike the two-hybrid system, the invention relies on dissociation of 
non-interacting elements as distinct from the association of interacting elements for the 
selection step. Moreover, the invention permits the harnessing of the power of phage 
display to greatly increase the degree of selection. 

20 

The invention optionally comprises the use of conditions or agents, during cleavage of 
the cleavable site, which modulate the lability of the cleavage site in the presence of the 
cleaving agent. This approach may be used to increase cleavage, for example to select 
only for polypeptides which fold in such a manner as to closely shield the cleavable site 
25 from access by the cleaving agent, or to decrease cleavage, for example to select stable 
mutants from a repertoire of polypeptides which is ordinarily relatively labile under 
cleavage conditions. 



For example, modulation of the lability of the cleavable site may be achieved by the use 
30 of agents which increase or decrease such lability. Thus, a protein denaturant may be 
included, at a suitable concentration, to destabilise a polypeptide and render it more 
labile. Alternatively, a ligand for a polypeptide may be included. The ligand may 
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stabilise the folded structure of the polypeptide, rendering it less sensitive to cleavage. 
Alternatively the ligand may destabilise the folded structure of the polypeptide, for 
example by favouring the adoption of an alternative configuration. This may render the 
polypeptide more accessible to the cleavage agent, and thus more labile. 

In a further embodiment, the conditions of the cleavage process may be altered, such as 
by manipulating the pH or the temperature at which cleavage is conducted, to achieve 
similar effects. Thus, deviation of the pH from the optimum for the polypeptide 
comprising the cleavable site may cause the site to become more accessible to the 
cleaving agent. Similarly, raising (or lowering) the temperature of the conditions under 
which the polypeptide is cleaved may render the polypeptide more or less susceptible to 
cleavage. 

In some instances, non-covalent interactions may be responsible for peptides retaining 
their structure and coat proteins remaining viable, even after successful cleavage of the 
cleavable site. The use of denaturants, temperature variation and other potentially 
destabilising techniques may also be used to decrease the likelihood of a cleaved 
polypeptide retaining its structure. 

Proteolytic selection for protein folding may be applied in a number of areas, as it 
allows much larger numbers of proteins to be processed than with conventional 
screening. For example, it allows the isolation of mutant proteins with improved 
stability [1], for example from combinatorial libraries of mutants in which residues at 
several sites are varied simultaneously [39, 40] or from random mutants or by 
recombination [3, 4]. It also allows the isolation of novel proteins and architectures 
from large repertoires of sequences [16-18, 41]; and for improvement in folding 
stability over several rounds of mutation and increasingly stringent selection, much like 
the affinity maturation of antibodies. 

A second configuration of the present invention concerns the use of tags to allow 
isolation of correctly folded heterologous polypeptides, exploiting the ability of 
correctly folded polypeptides to protect a cleavable site on to near to an associated ta°, 
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The insertion of a polypeptide between the stable tag fused to the N-terminus of the 
viral coat protein and the coat protein itself, followed by cleavage, provides a means of 
selection for virus bearing proteins that are resistant to proteolysis and are folded. Thus 
only virions, whose inserted polypeptide is not degraded, will keep the tag fusion as 
5 part of their coat, and only these virions can therefore be captured by affinity 
purification using this tag. After elution the affinity captured phases from the ligand, 
these phages can be propagated and subjected to further rounds of the same selection 
procedure. 

10 Alternatively, virions may be bound to an affinity matrix, comprising a ligand for the 
tag, prior to cleavage. The cleaving agent may subsequently be added, and only 
resistant phage will be retained on the matrix. These may then be eluted as required. 

Suitable matrices include columns, beads and other surfaces to which a ligand for the 
15 tag is bound. 

According to the present invention, reference to selection may be interpreted as a 
reference to screening, since the same processes may be used to screen phage, as will 
be apparent to persons skilled in the art. 

20 

Cleavable sites are substantially as described for the previous configuration of the 
present invention and are advantageously protease cleavable sites. 

Cleavage requires the polypeptide chain to bind and adapt to the specific 
25 stereochemistry of the protease active site, and therefore to be flexible, accessible and 
capable of local unfolding [14, 15]. Folded polypeptides or proteins are often resistant 
to proteolysis, due to a relative inflexibility in their structure, whilst unfolded proteins 
remain sensitive. 
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As referred to above, the possible selection of polypeptides from a repertoire which, 
through variation or mutation, do not contain a recognition sequence for any particular 
protease used in this method, can be circumvented in two ways. For example, the use 
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of a cocktail of proteases with very distinct recognition sequences would ensure that all 
polypeptides should be cleavable, if not protected by their folded status. Alternatively, 
a phage repertoire of polypeptides to be selected could be partially denatured, such that 
the inserted polypeptide unfolds but the phage and the N-terminal tag remains intact. 
5 Proteolytic digestion followed by affinity purification would remove all phages from the 
repertoire, which have escaped proteolysis due to the lack of protease recognition 
sequences in the polypeptide. Phages not bound by the resin, contain only phages, 
which contain the protease recognition sequence in the polypeptide displayed and which 
may or may not escape proteolysis under non-denaturing conditions. Thus these would 
10 be subjected to proteolytic selection based on protection by the folding status of the 
polypeptide displayed. 

The selection process may also be used for the identification of interacting protein 
elements. Thus if two such elements linked by a polypeptide comprising protease 
15 cleavage sites are cloned between the N-terminal, proteolytically stable tag for display 
on phage and the coat protein, the only phages after proteolysis, that can be captured 
via affinity binding to the tag, are those in which the tag and the p3 protein are held 
together by non-covalent interactions between the interacting protein elements, 

20 The invention is further described in the following examples, for the purposes of 
illustration only. 

Example 1. Resistance of filamentous phage to proteolysis, 

25 Materials and Methods for Examples 1 - 6 are appended at the end of Example 6. 

Phage is incubated under a range of denaturing conditions in vitro and then restored to 
native conditions immediately before infection of bacteria. The incubation of phage in 
10 M urea, or extremes of pH (as low as pH 2, and as high as pH 12) and temperature 
30 (as high as 60°C) did not lead to a major loss of infectivity (Table 1). This indicates 
that the phage is either resistant to denaturing conditions or that if it does unfold it is 
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able to refold rapidly. However with GndHCl a 5 fold loss in phage infectivity is 
observed above 5 M and a further 5 fold loss at 8 M (Table 1). 

Phage is then incubated under native conditions with a range of proteases (trypsin, 
5 Factor Xa, IgA protease, Asp-N, chymotrypsin, Arg-C, GIu-C, thrombin, thermolysin, 
subtilisin) with different specificities. There is no loss in infectivity except for subtilisin 
which has been reported to cleave the p3 protein [24]. If phage is incubated under 
denaturing conditions in the presence of proteases such as trypsin in 3.5 M urea (or > 
47 °C), infectivity is lost. This indicates that under denaturing conditions the unfolding 
10 of the phage coat proteins is sufficient to make sites available for proteolysis. 

Example 2. Construction of phage with protease cleavage sites. 

A sequence (PAGLSEGSTIEGRGAHE) comprising several proteolytic sites is inserted 
15 in the flexible gly cine-rich region between the D2 and D3 domains of the phage p3. 
Incubation of the phage (fd-K108) under native conditions with trypsin, thermolysin or 
subtilisin now resulted in almost complete loss of infectivity (from 10 7 to < 10 TU/ml) 
and incubation with Glu-C and chymotrypsin resulted in a major loss (from 10 7 to 10 4 
TU/ml). This indicates that these proteases cleave the new linker. However incubation 
20 with Factor Xa, Arg-C or thrombin did not lead to a loss in infectivity, despite the 
presence of potential cleavage sites for these enzymes. Presumably the presence of the 
D2 and D3 domains may block access or cleavage for these enzymes in the case of the 
present polypeptide. 

25 Example 3. Construction of protease cleavable helper phage and phagemid 

Fusion of proteins to p3 should lead to multivalent display of the protein on the phage. 
However if the protein is fused to p3 encoded by a phagemid (such as pHENl [25]), 
and the bacteria harbouring the phagemid is rescued with a helper phage (such as 
30 VCSM13), the fusion protein has to compete for incorporation into the phage with the 
helper p3. This leads to so-called "monomelic" phage, in which usually less than one 
copy of the fusion protein is attached to each phage particle [26] . 
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The use of "monomeric" phage might be expected to be advantageous for selection of 
high affinity interactions. Furthermore fusion proteins nTmonomeric" phage should be 
more sensitive to proteolysis, as interactions between multimers of fusion protein are 
avoided. However a disadvantage is that the majority of infective phages do not display 
a protein; such phages binding non-specifically to solid phase are amplified during each 
round of phage growth. 

Protease cleavable helper phage are therefore constructed, by introducing the protease 
cleavage sequence between the D2 and D3 domains to generate the helper phage KM 13. 
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KM 13 is shown to rescue the phagemid pHENL Furthermore trypsin is shown to 
cleave a major fraction (about 50%) of p3 of the rescued phage as shown by Western 
blot and detected with an anti-D3 mAb (Fig. 1). However phage infectivity is hardly 
altered by the cleavage; it therefore appears that only a fraction of the p3 need be entire 
to mediate bacterial infection. 



KM13 is also shown to rescue a pHENl phagemid encoding a single chain antibody 
fragment [27]. Here cleavage by trypsin resulted in a 50 fold loss in phage infectivity 
(data not shown), consistent with indications that only a small fraction of the phage 
express fusion protein when rescued with helper phage [26, 28]. 

A protease cleavable phagemid is also constructed. The phagemid can be rescued with 
KM13 or VCSM13. As expected, infectivity of this phagem:id rescued with KM13 (but 
not VCSM13) is destroyed by trypsin. This phagemid vector is prone to deletions in the 
D2-D3 linker; by changing the codon usage in the linker regions on either site of the 
protease cleavable site, and shortening the length of these linker regions, a more stable 
vector is created (pKl; Fig. 2). In a second vector (pK2; Fig. 2), the sequence of the 
polylinker is arranged so as to place D3 out of frame to render religations within the 
poly linker non-infectious. 

Example 4. Construction of a phage antibody library using protease cleavable 
helper phage. 



Bacteria are electroporated with phagemid DNA encoding a repertoire of scFv 
fragments fused to the N-terminus of p3 and grown in liquid culture (2xTY containing 
antibiotic to select for bacteria containing phagemid and glucose to suppress expression 
of gene 3). In the mid log phase of bacterial growth (OD600 = 0.5) the helper phage 
KM13 is added to the bacteria to give a ratio of helper 'phage to bacteria of 20:1. The 
bacteria is incubated at 37 C C without shaking for 45 min then with shaking for 45 min. 
The bacteria are harvested by centrifugation and resuspended in fresh medium 
containing 50 /xg/ml kanamycin and antibiotic, without glucose, to select for presence of 
phagemid DNA. The culture is grown overnight at 30°C with shaking. 



19 



Bacteria are removed from the phage containing supernatant by centrifugation. Phage is 
precipitated from the supernatant by adding 1/5 the volume of 20% PEG/ 2.5 M NaCl. 
After 1-2 hours at 4°C the precipitated phage is collected by centrifugation. The phage 
is resuspended in PBS (a second PEG precipitation is optional) and can be used in 
5 . selection. 

The library of phages is allowed to bind to antigen (immobilised on solid support such 
as an immunorube or in solution to tagged (i.e. biotinylated) antigen which can be 
immobilised after affinity binding of phage antibodies). Unbound phage is removed by 
10 extensive washing (the stringency of washing can be varied with respect to time and 
detergents added). 

Phage libraries comprising a cleavable tag, such as the c-myc tag inserted between the 
antibody and gene 3, can be eluted by addition of trypsin in solution at a concentration 

15 of 0.1 to 1 mg/ml. (Phage libraries without a cleavable sequence between the antibody 
and gene 3 can be eluted by adding 100 mM Triethylamine. In this case the solution is 
neutralised by adding 1 M Tris-HCl pH 7.4. and after 10 min, trypsin added to a final 
concentration of 0.1 to 1 mg/ml.) Trypsin also cleaves the copies of gene 3 from the 
helper phage, while leaving gene 3 from the phagemid intact. Thus only phage that had 

20 carried, or still carry, an antibody fusion will be infective. 

Phage is used to infect bacteria in mid log phase of growth (OD600 = 0.5), and the 
bacteria is plated on agar plates containing antibiotic selecting for phagemid DNA. 
Individual clones were picked and phage prepared as above. The resulting phage is used 
25 in ELISA to identify phage antibodies binding specifically to the antigen of interest. 

Example 5. Selection for folding using barnase as a model. 

Barnase is a small RNase of 110 amino acid residues whose folding has been 
30 extensively studied (for review [2]). Barnase contains multiple sites for trypsin 
cleavage, although the folded protein is resistant to cleavage (data not shown). Phage 
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with barnase cloned between D2 and D3 should therefore be resistant to protease 
cleavage and capable of selection. 

As barnase is toxic to Escherichia coli, a mutant A (Hisl02-> Ala) is cloned which is 
5 catalytically inactive but stable [29, 30] into the phagemid pK2. A mutant B (His 102- 
> Ala,Leul4-> Ala) is also cloned, with lower stability; Leul4 is buried in the 
hydrophobic core and its mutation creates a large cavity in the core affecting the 
packing of different structural elements [31]. The phages (rescued with KM13) bind to 
the inhibitor barstar by ELISA (Fig. 3), and therefore display the mutant barnase in a 
10 folded form. 

The phages are then incubated with trypsin at a range of temperatures (Fig. 4). After 
incubation at 10 °C, there is a decrease in phage infectivity of 5 to 10 fold for both 
mutants, suggesting that (as above with the display of scFv fragment) only a small 
15 fraction of the phages display the fusion protein. There is no further loss in infectivity 
on cleavage until 30°C (for mutant B) or 37°C (for mutant A). In both cases the major 
transition is at least 10°C below that expected for the reversible thermal unfolding of 
the mutants. 

20 Phages A and B are mixed in different ratios and incubated with trypsin at 20 C C, where 
both mutants are stable to cleavage, or at 37 C C where only A is stable. After 
"proteolytic selection" the phages are plated and analysed by PCR, which is followed 
by restriction digest to distinguish the mutants. As shown in the Table 2, mutant A is 
enriched by a factor of 1.6 x 10 4 after a single round and by 1.3 x 10 6 after two rounds 

25 of proteolytic selection at 37° C. No enrichment can be detected at 20°C. 

Example 6. Selection for folding using villin as a model. 

The 35 amino acid subdomain of the headpiece domain of the f-actin-bundling protein 
30 villin [32] is much smaller than barnase. It nevertheless forms a stable fold at room 
temperature and is resistant to proteolysis; furthermore its stability does not rely on 
disulphide bonds or binding iigands [33]. The villin subdomain (which contains several 
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potential trypsin cleavage sites) is cloned between the D2 and D3 domains of the phage, 
and incubated with trypsin at different temperatures. The profile for loss of infectivity is 
not as sharp as with barnase, with the major transition below 35 °C, considerably below 
the thermal unfolding of villin (70°C) [32, 33]. The phage displaying villin are mixed 
5 with phage, which were produced using the phagemid pKl and the helperphage KM13, 
and incubated with trypsin. After a single round of proteolytic selection, the villin 
fusion phage are enriched 8.7 x 10 3 fold (Table 3). 

In summary, the results from Examples 1 and 2 show that the infectivity of the phage is 
10 relatively resistant to temperature, pH, urea and GndHCl, and to several proteases, but 
if a flexible linker comprising a protease cleavage site is inserted between domains D2 
and D3 of the phage coat protein p3, the phage becomes sensitive to cleavage. By 
contrast, as shown in Examples 5 and 6, if the protease cleavage sites comprise a folded 
protein domain such as barnase or villin, the phage is resistant to cleavage. This allows 
15 proteolytic selection for protein folding, with enrichment factors of greater than 10 4 
fold for a single round of selection. Selection is evident for both for barnase, an average 
sized [34] domain of 110 amino acids and for villin, a small domain of 35 amino acids. 

Discrimination between structures of different stabilities can be accomplished by 
10 increasing the stringency of proteolytic selection. Thus with increase in temperature, 
both barnase and villin became susceptible to cleavage, reflecting protein unfolding. 
However the main impact of protease cleavage is at a temperature lower than the 
unfolding transition as measured by circular dichroism [38]. This may reflect the fact 
that the unfolding transition is a fully reversible process, whereas the cleavage by 
^5 proteases (of unfolded structure) is a kinetic and irreversible process, pulling over the 
equilibrium from folded to unfolded (and cleaved) structure. This is consistent with the 
CD unfolding transition seen with villin [33], where at temperatures as low as 35 °C 
there is evidence of unfolding, the same point at which villin starts to become 
susceptible to protease attack. 
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Table 2. Selection of Barnase mutants. 

Mixtures of barnase mutants (A+B) in ratios from 1:1 to 1:10^ were selected by 
proteolysis at 37 °C, 24 (or 36 in round 2) phage clones analysed and numbers of each 
5 mutant noted above. a Selection at 20 °C where both mutants are expected to be stable, 
^ before selection. 

Phage A: Phage B 





1:1* 


1:10 2 


1-.10 4 


1:10 6 


1:108 


Round 1 Phage A 


16 (14*) 


24 


20 


0 


nd 


Phage B 


8 (10*) 


0 


4 


24 


nd 


Enrichment 






1.6X10 4 




nd 


Round 2 Phage A 


nd 


nd 


nd 


24 


0 


Phage B 


nd 


nd 


nd 


12 


36 


Enrichment 


nd 


nd 


nd 


1. 3x10 s 





10 
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Table 3 Selection of villin. 

Mixtures of villin-phage and pKl rescued with KM 13 in ratios from 1:1 to 1:10^ were 

selected by proteolysis at 10°C, 24 phage clones analysed and number of each noted 
5 above. a before selection. 

villin-phage : pKl 

1:1 1:10 2 1:10 4 1:10 6 

"pKl 0 (16 c ) 6 7 24 

Villin 24 (8 G ) 24 17 . 0 

Enrichment - - 8 7x10^ 



10 



15 



20 



25 



24 

Table 4. Primer sequences 



pkl inker 

polyXaf or 

polyXaback 

fdPCRBack 

LI3S SQf or 

LIBSHQback 

LSPAfor 

LSPAback 

Flagprimer 

RECGLYfor 

RECGLYback 



delCKpn 
Barnasef or 

3arnaseH102Aba 
ck 

villinf or 
villinback 



5 * GGCACCCTCAGAACGGTACCCCACCCTCAGAGGCCGGCTGGGCCGC 
CACCCTCAGAG 3' 

5 ■ GGTGGCGGCCCAGCCGGCCTTTCTGAGGGGTCGACTATAGAAGGAC 
GAGGGCCCAGCGAAGGAGGTGGGGTACCCCCTTCTGAGGGTGG 3 • 
5 ' CCACCCTCAGAAGGGGGTACCCCACCTCCTTCGCTGGGCCCTCGTC 
CTTCTATAGTCGACCCCTCAGAAAGGCCGGCTGGGCCGCCACC 3 ' 
5 1 GCGATGGTTGTTGTCATTGTCGGC 3 ' 
5 » AAAAGAAACGCAAAGACACCACGG 3 ' 
5 ' CCTCCTGAGTACGGTGATAGACC 3 ' 
5 ' GTAAATTCAGAGACTGCGCTTTCC 3 1 
5 1 ATTTTCGGTCATAGCCCCCTTATTAG 3 * 

5 ' CAAACGGGCGGCCGCAGACTACAAGGATGACGACGACAAGGAAACT 
GTTGAAAGTTGTTTAGCAA 3 1 

5 ' CCCCTCAGAAAGGCCGGCTGGGCCGCCGCCAGCATTGACAGGAGGT 
TCAGG 3 ' 

5 • GAAGGAGGTGGGGTACCCGGTTCCGAGGGTGGTTCCGGTTCCGGTG 
AT 

TTTG 3 ' 

5 ' CCCTCGGAACCGGTACCCCAGCTGCTTCGTGGGCCC 3 ' 

5 ' CTGGCGGCGGCCCAGCCGGCCCTGCACAGGTTATCAACACGTTTGA 
C 3 ' 

5 ' CTCGGAACCGGTACCTCTGATTTTTGTAAAGGTCTGATAAGCG 3 ' 

5 ' GGCGGCCCAGCCGGCCTTTCTCTCTCTGACGAGGACTTCAAGGC 
3 ' 

5 ' CCTCGGAACCGGTACCGAAGAGTCCTTTCTCCTTCTTGAGG 3 ' 



Materials & Methods (Examples 1-6) 



Materials 

5 All restriction enzymes, T4 ligase are obtained from New England Biolabs. Taq DNA 
polymerase is obtained from HT Biotechnology. Pfu DNA polymerase is obtained from 
Stratagene. Ultrapure dNTP from Pharmacia. Proteases and the protease inhibitor 
Pefabloc are obtained from Boehringer Mannheim, except chy mo trypsin and trypsin 
TPCK-treated which are obtained from Sigma. All other chemical are likewise obtained 
10 from Sigma. 

Phage preparation 

Escherichia coli TGI [42] is used for cloning and propagation of phage. TGI 
15 harbouring fd-DOG [43] or derivatives is grown overnight in 2xTY containing 15 
jig/ml tetracycline. Phagemids are rescued using KM13 or VCSM13 as described [27]. 
Phage particles are prepared by two PEG precipitations [44]. 

Vector construction 

20 

The phage vector fd-DOG [43] is used as parent vector for coastruction of the protease 
cleavable fd-K108. Unique restriction sites (Sfil, Kpnl) are introduced into the glycine 
rich spacer region between D2 and D3 using the Sculptor in vitro mutagenesis system 
(Amersham) and the oligonucleotide pklinker (Table 4). Further restriction sites (Apal, 
25 Sail) and sequence encoding a protease cleavage site are cloned between the Sfil and 
Kpnl sites using the oligonucleotides polyXafor and polyXaback to create the vector fd- 
K108. 

The protease cleavable helper phage KM13 is prepared from fd-K108 by transplanting 
30 into the helper phage VCSM13 a BamHl-Clal fragment generated by PCR and primers 
fdPCRBack and LIBSEQfor. 
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A protease cleavable phagemid vector is derived from fd-K108 much as above except 
using pCANTAB 3 (Pharmacia). A FL AG-tag is introduced at the N-terminus of Dl by 
cloning of a Notl-Sfil fragment generated by PCR and primers Flagprimer and 
LSPAback. To circumvent deletions due to repeated sequence in the D2-D3 linker, the 
5 codon usage of the polylinker region is changed in two steps (a) using a Bam-Sfil 
fragment generated by PCR and primers RECGLYfor and LIBSEQfor, screening 
recombinants by PCR and the primers LSPAfor and LSPAback, (b) using a KpnI-CIal 
fragment generated by PCR and the primers RECGLYback and LIBSEQback, screening 
recombinants using LSPAfor and LSPAback. The resulting vector is pKL The entire 
10 p3 gene is sequenced using PCR cycle sequencing with fluorescent dideoxy chain 
terminators (Applied Biosy stems) [45]. The "out of frame" vector pK2 is derived from 
pKl by site direct mutagenesis using the oligo delCKpn and the Sculptor Amersham kit. 
The precise sequences of pKl and pK2 are set forth in Kristensen et ai, (1998) Folding 
& Design 3: 321-328. 

15 

Cloning of Barnase and Villin 

The vectors encoding the single barnase mutants, Hisl02->Ala and Leul4-> Ala [29, 
46] are used as templates for PCR amplification with primers Barnasefor and 
20 BarnaseH 102 Aback and Pfu polymerase. The PCR products (encoding the single mutant 
His 102- > Ala, and the double mutant HisI02->Ala T Leu 14- > Ala) are digested using 
the restriction enzymes Sfil and Kpnl, and ligated into vector pK2 to give the 
phagemids pK2BA and pK2BB respectively and the barnase genes sequenced using PCR 
cycle sequencing. 

25 

The 35 amino acid thermostable fragment of the headpiece of the f-actin binding protein 
villin [33] is amplified from chicken bursa cDNA using PCR primers villinfor and 
villinback with Pfu polymerase. The PCR products are cloned as above to give the 
phagemid pK2V. 

30 
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Resistance of phages to denaturants, pH and proteases 

For resistance to denaturants, 10 M urea in PBS (25 mM NaH2P04, 125 mM NaCl pH 
7.0) or 8 M GndHCl (Guanidine hydrochloride) and 50 mM Tris-HCl pH 7.4, 1 mM 

5 CaC12 (buffer A) is added to 10 pi phage stocks (10 8 -10 10 TU) to give a volume of 1 
ml and the conditions specified in Table 1. The phage are incubated for 1-2 hrs, then 
100 pi aliquot added to 1 ml TGI (OD600 -0.5) and serial dilutions plated on TYE 
plates with 15 pglrrl tetracycline. For resistance of phage to extremes of pH (2-12), 
Tris glycine or Tris HC1 buffers (0.1 M glycine or 0.1 M Tris respectively) are added 

10 to 10 pi phage stocks, and to neutralise each 100 p\ aliquot we added 50 pi 1 M Tris- 
HCl pH 7.4 before infection. For resistance to temperature, buffer A is added to 10 pi 
phage stocks to give a volume of 1 ml and incubated at a given temperature (20 - 60 C) 
for 1 hr. 100 pi aliquots are added to TGI and plated as above. For resistance to 
proteases, 100 mM NaCl, 50 mM Tris-HCl, 1 mM CaCb pH 7.4 (Factor Xa 100 ng/ml 

15 or trypsin, chymotrypsin, thrombin, thermolysin and subtilisin all 100 /xg/ml) or 50 
mM Tris-HCl, 1 mM EDTA pH 7.4 (IgA Protease 10 ng/ml) or 50 mM NH4CO3 pH 
8.0 (Arg-C 100 Mg/ml, Glu-C 100 pg/ml) or 25 mM NaHiPCM, 125 mM NaCl pH 7.0 
(AspN 40 ng/ml) is added to 10 pi phage stocks (fd-DOG and fd-K108) to give a 
volume of 100 p\ and a final concentration of protease as indicated. Digestions are 

20 incubated for 15 min at room temperature, samples (100 pi) are then infected into TGI 
as above. 

For resistance to proteases in the presence of denaturants samples are prepared as above 
for urea and temperature denaturation. To 90 pi aliquots 10 pi trypsin (1 mg/ml) is 
25 added, after 5 min at room temperature 4 pi Pefabloc (100 mM) is added and the 
samples are infected into TGI as above. 

Western blot 

30 Phages (pHENl rescued using KM13 and pKl rescued using VCSM13) are subjected to 
SDS-PAGE [47] before or after cleavage by trypsin (50 ng/ml). After semidry transfer 
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to PVDF membranes the filter is process essentially as described [27]. The primary 
antibody, monoclonal anti-gill (MoBiTec), is added in a 1:5000 dilution followed by 
anti-mouse HRP-conjugated antibody (Sigma) in a dilution of 1:50000. Finally the filter 
is developed using the luminol based Chemiluminescence Western Blotting kit 
5 (Boehringer Mannheim). 

ELISA 

Phage displaying barnase mutants are analysis for binding to the RNase inhibitor barstar 

10 as described [44]. 10 pmol biotinylated barstar is mixed with approximately 10^0 phage 
displaying barnase mutant A or barnase mutant B or villin or buffer A. Phage binding 
barstar is captured on Streptavidin coated plates (Boehringer Mannheim) and developed 
using HRP conjugated anti-M13 antibody (Pharmacia) and 2,2'-Azino-Bis(3- 
Ethylbenzthiazoline-6-sulfonic acid) (Sigma). Absorbance readings are taken at 405 nm. 

15 

Temperature denaturation 

At each temperature approximately 10^ phage displaying the barnase mutants or villin 
(ampicillin resistant) is mixed with a cleavable control fd-K108 (tetracycline resistant), 

20 and a non-cleavable control phagemid, a chloramphenicol resistant derivative of 
pHENl, rescued with KM 13 in a total volume of 90 pi of buffer A. After equilibration 
for 20-30 min at the temperature indicated, 10 pi trypsin (5 /xg/ml) is added and the 
incubation continued for 2 min. Trypsin is neutralised by adding 4 pi 100 mM Pefabloc. 
Infection and serial dilution is performed in TG-1 as above and aliquots are plated on 

25 TYE plates containing 100 jig/ml ampicillin + 1 % glucose, 30 pg/ml chloramphenicol 
+ 1 % glucose or 15 pg/m\ tetracycline. 

Selection experiments 

30 10 pi of serial dilutions of the barnase mutant phage A is mixed with 10 pi of the non- 
diluted barnase mutant phage B in 70 pi buffer A. After 30 min incubation at 20°C or 
37 °C 10 pi trypsin (5 /zg/ml) is added. Following 2 min. of digestion 4 pi Pefabloc 
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(100 mM) is added. The phage are infected into TGI as above. A second round of 
selection are performed by scraping bacteria in 3 ml 2xTY, 50 p\ inoculated into 50 ml 
2xTY/Amp/Glu and the phagemid rescued and phage prepared as above. Clones are 
analysed by PCR using the primers LSPAfor and LSPAback followed by restriction 
5 digestion using Ddel. 

Selections between pK2V and pKl phage particles are performed as above, except the 
selection is performed at 10°C. Clones are analysed by PCR using the primers LSPAfor 
and LSPAback. 

10 

Example 7. Use of protease-cleavable helper phage for selection of signal 
sequences. 

Translocation of proteins is directed by signal peptides [48]. These are known to share 
15 common features such as a positively charged amino-terminal region, a hydrophobic 
sequence and a carboxy-terminal region including the signal peptidase cleavage site. 
Signal peptides are involved in 11 an array of protein-protein and protein-lipid 
interactions" [49]. The signal sequence may in addition interfere with the protein 
folding pathway. They are also involved in translational regulation, mainly through the 
20 downstream box. 

Enzymes such as DNA polymerase are very poorly displayed on phage particles, 
making their selection almost impossible. In order to improve the capabilities of phage 
display to select polymerases, therefore, improved signal sequences are designed and 
25 selected for using a phage display technique in which "empty" phage background is 
eliminated by digestion of helper phage. 

Design of a signal sequence for optimal polymerase display on phage is not easily 
achieved. A selection strategy is therefore devised to isolate signal sequences from a 
30 library where mutations are introduced at selected sites. Although the signal sequence is 
not present on phage, its sequence is easily retrieved by sequencing the phagemid 
located within the phage particle. 
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Two libraries are generated from pelB and g3 leader sequences, making use of the 
following oligonucleotide primers for PCR amplification: 

1: TACGCCAAGCTTGCATGC; 
2: CTGCACCTGGGCCATGG; 
3: GATTACGCCAAGCTTTG 

4:GATTACGCCA4GC7TGCATGCANNDDCTNTDTCAAGGAGACAGTCATAATG 
ARRNNBCTATTGSYAAYRSYASYASYAGBNTTGTTATTACTCSYANYCVNNCY 
GDCC47GGCCCAGGTGCAGCTG; 

5:GATTACGCCA4GC77TGNNNNCTTTTTTWGGAGATTTTCAACRTGAMAR 

ATTATTATTCSYAATTSYTTTAGTTSYTSYTTTCTWTGYGGYCCAGCCGGCG47 

GGCCC AGGTGCA . 

6: CTTTATGCTTCCGGCTCG. 

7: CGGCCCC ATTC AG ATCC . 

The restriction sites Hindlll and Ncol are noted in italics. 

Library I deriving from the pelB leader and library II deriving from the g3 leader are 
prepared by PCR amplification of 4 (pelB) amplified with 1 and 2 and of 5 (g3) 
amplified with 3 and 2 respectively. Each PCR product is digested with Hindlll and 
Ncol and purified with a gel extraction kit (Qiaquick, Qiagen). 0.2 jig of each resulting 
insert is mixed for ligation with about 2 ug of pHENl-Stoffel vector (Fig. 6) previously 
digested with Hindlll and Ncol and dephosphorylated with alkaline phosphatase 
(Boehxinger Mannheim). The ligation mixture is purified by phenol-chloroform 
extraction and ethanol precipitation prior to electroporation into freshly prepared E. coli 
TGI. 

Randomisation of 32 and 20 bases for the pelB and g3 leaders respectively is carried 
out: (i) near and within the Epsilon sequence just upstream the Shine-Delgarno sequence 

(ii) downstream the Shine-Delgarno sequence near and within the Downstream box [50] 

(iii) within the leader peptide at the N-terminal region containing the positively charged 
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amino acid residues, in the hydrophobic region [51] and in the C -terminal region close 
to the highly conserved peptidase cleavage site [52]. 

Phage is produced as described previously [25] except that the helper phage KM 13 
5 (Example 3) is used instead of VCSM13 and that 0. 1 mM IPTG is added when specified 
to the overnight culture at 30°C. Selections for resistance to trypsin and for binding 
[44] is done as described earlier, except that 11 jig of anti-Taq antibody (Taqstart, 
Clontech) is coated overnight on immunotubes (Nunc). PCR screening is done with 
primers 6 and 7 using single E.coli TGI colonies containing the phagemid as template 
10 following a previously described protocol; after gel electrophoresis on a 2% agarose 
gel. The size of the amplified fragment is used as a criterion to establish whether 
deletions within the polymerase gene have occurred. 

The calculated diversity of the libraries computed from the degeneracy of the 
15 synthesised oligonucleotides is about 3.7xl0 13 = 4 9 x3 7 x2 16 and 1.7xl0 7 = 4 4 x2 16 for 
the pelB and g3 leaders respectively. After transformation of E.coli with the phagemid 
libraries, the library size measured as the number of ampicillin-resistant colonies is 

found to be 1.3xl0 7 and 9.6xl0 6 for the pelB and g3 leaders respectively. 

20 The selection for display of the polymerase is done by cleaving specifically the helper 
phage p3 copies with the protease trypsin so as to render non-infective all phage 
panicles that are not expressing any p3-polymerase fusion protein. 

Both libraries are mixed and the selection rounds are carried out in two conditions, with 
25 or without 0.1 mM IPTG in the culture medium. With IPTG, deletions of the 
polymerase gene or of part of it are noticed after round III (4 out of 28 clones) as shown 
by a PCR screening (see above); after round IV, these clones represent most of the 
population (28 out of 30). Without IPTG, these clones represent a significant part of the 
selected ones after round VI (3 out of 12). The selection is therefore changed from 
30 round five on by introducing in addition to selection for protease resistance, a selection 
for binding to an anti-Taq antibody. After the selection rounds VII and VIII, (3 out of 
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13) and (0 out of 19) clones respectively correspond to deleted p3-polymerase fusion 
proteins. 

For characterisation of the leaders, the HmdIII-Ncol fragments are subcloned after PCR 
5 amplification of individual E.coli colonies. The resulting phagemids are noted pHENl- 
Ix/Stoffel subcloning with x=7,9,10 and 12. The HindDI-Ncol and the NcoI-NotI 
fragment corresponding to the Stoffel fragment is sequenced on both strands using a 
373 A DNA sequencer (Applied Biosystems). 

10 For ELISA, an anti-Taq antibody (Taqstart, Clontech) is used for coating the ELISA 
plate and an anti-M13-horseradish peroxidase fusion protein (Pharmacia Biotech) is 
used for detection in a standard protocol [44]. 

Expression of polymerase in the supernatant is made by infection of E.coli HB2151 
15 with selected phagemids [25, 27] except that the IPTG concentration is 0.1 mM instead 
of 1 mM. About 10 jil of supernatant is loaded on a polyacrylamide gel for 
electrophoresis (Novex); the gel is blotted onto nitrocellulose (Protran, Schleicher and 
Schuell) and an anti-Taq polymerase antibody (Taqstart, Clontech), and a goat anti- 
mouse IgG-horseradish peroxidase (Sigma) prior to detection on autoradiography films 
20 by chemiluminescence (ECL reagents, Amersham). 

Four individual clones 7, 9, 10 and 12 from round VII, that were screened for protease 
resistance among 12 clones, are further characterised. To ensure that only mutations 
within the signal sequence are considered, and not mutations somewhere else within the 

25 phagemid that may have occurred during amplification at the various rounds, the signal 
sequences are subcloned into the original vector. The results shown in Table 5 indicate 
that optimal polymerase display for the selected clone 10 is about 50 fold higher than 
for the original sequence. This result is confirmed independently within experimental 
errors by an ELISA using anti-Taq antibody and anti-M13-HRP: 10 9 phage particles of 

30 the pelB leader phagemid pHENl -Stoffel (signal to noise ratio: 1.46) give an identical 
signal as 10 7 phage particles of clone 10 (signal to noise ratio: 1.47). 
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The expression of Stoffel fragment in E.coli HB2151 is also studied for the various 
leaders (see Table 6). The concentration of Stoffel fragment in the culture supernatant 
is estimated by comparing the spot intensities for known amounts of polymerase and 
found to be about 0.1 mg/1 for the pelB leader. A 3-fold increase in expression is 
5 observed for the leaders 17 and 110, whereas an about 3-fold decrease is noted for leader 
19. 

Table 5. Number of phage-polymerases per phage particle with leader pelB, as a 
function of temperature and IPTG concentrations. 

10 

The phage titer is measured as the number of infective phage particles and the phage- 
polymerase titer as the number of infective phage particles after treatment with trypsin. 
The number of phage-polymerases per phage panicle is the ratio of the titers. As the 
phage particles were rescued with a helper phage, phage displays either a p3- 
15 polymerase fusion protein and a few p3 copies containing a trypsin-cleavage site or only 
these p3 copies. 



Temperature 25°C 30°C 37°C 

20 0 mM IPTG 1.7x10-3 9.1xl0" 4 1.2xl0' 3 

0.1 mM IPTG 9.1x10-3 * 8.3xl0" 3 2.4x10-3 

1 mM IPTG 1.5x10-2* 5.5xl0"3 * 1.2xl0"3 



25 



in these conditions, the phage titer drops below 10 10 /ml of culture medium. 
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Table 6. Phage characterisation for leader pelB or selected leaders from round 
VII for optimal polymerase display (same legend as for Table 5; culture in 2xTY at 
T=30°C without IPTG). 

Leader Titer Number of phage-polymerases 

x 1 0 U per phage particle 



pelB 


1.2 


9.1xl0- 4 


17 


2.0 


2.5xl0- 2 


19 


0.5 


8.3x10-3 


110 


0.7 


4.3xl0- 2 


112 


1.6 


1.4x10-2 
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Table 7. Randomised and selected sequences. 

The randomised DNA sequence is given from 5' to 3'; above and below it, the bases 
that differ from the given sequence in the signal sequences pelB, 17, 19, 110 and 112 are 
5 indicated. The Shine-Delgarno sequence, the start codon and the last codon of the signal 
sequence, GCC, have been underlined. The Hindlll and the Ncol restriction sites are in 
italics. The corresponding amino acid sequences are given below. Library I is initially 
designed from the pelB leader and library II from the g3 leader. 

10 III-A. From library I 

?el3 AATT A T AATAC 

5 ! AA GC TIu C AT GC ANNDD CTNT DTCAAGGAGACAGTCATAATGARRNN3 CT 





17 


GCAT 


C 


G 


AGACG 


15 


110 


CGGG 




T 


GAGGG 




112 


CCAC- 


c 


T 





pelB CCT CGGC GGCGCT GA GCGGC CAG C G 

20 ATTGS YAAYRS YAS YAS YAGBNTTGTTATTACTC SYANY CVNNCYGD CCATGG 

CC 3 1 

17 GC TGGT CT GT GA CC CC GGT C T 

-10 GC TGCT GT GC GG CC AT GCG C G 

112 GT TAGC GC GT GG CT GC CCC C A 



25 



pelB MKYLLPTAAAGLLLLAAQPAMA 

17 K T AMVLVG ??G?S 

HO R G A M L V A G ? I A P A 

112 RR V I A A V G LAPPET 



30 
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III-B. From library II 



g3 leader GAGC TT G A A 

5 ' AAGCriTGNNMTCTTTTTT WWGGAG ATTTTCAACRTGARAARATTATTAT 
19 GGGC TA A G G 

GC CC GT CC A C C 

TCSYAATTSYTTTAGTTSYTSYTTTCTWTGYGGYCCAGCCGGCCATGG CC3 1 

19 CT CC GT GC A T T 

g3 leader MKKLLFAI P L V V ? ? YAAQPAMA 
19 R R LP V A Y V V 

Example 8. Selection of a catalytic activity using protease-cleavable helper phage. 

A strategy for the selection of catalysts by phage display is based on selection of the 
reaction product of a calatytic reaction, and the use of proximity effects to select the 
catalyst. In this strategy, a tagged substrate is crosslinked to the phage in the proximity 
of the displayed enzyme; the phage is thereby attached to a solid-phase and released by 
an intramolecular cleavage reaction catalysed by the displayed enzyme [53]. 

A similar approach has been applied to the selection of active DNA polymerase 
variants. The approach involves two chemically independent reactions, the catalytic 
reaction leading to a product (in this case distinguished by incorporation of a biotin tag) 
and a chemical crosslinking reaction by which 'the substrate (and product) are linked to 
the phage. Selection of the phage by streptavidin beads therefore selects for phages 
which are chemically attached to tagged product; these reactions are more likely to be 
coupled on the same phage as reactions in cis are favoured over reactions in trans by 
proximity. 

Maleimides are used in a chemical cross-linking reaction. These are known to react 
with thiols and in alkaline solutions with amino groups, and are therefore capable of 
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reacting with a wide range of sites on the phage and on the displayed enzyme. A 
covalent product between the major coat protein (p8) and N-biotinoyl-N r -(6- 
maleimidohexanoyl) hydrazide, is detected by SELDI mass spectrometry. Two amino 
groups (the N-terminal Ala-1 and the residue Lys-8) are thought to be involved. 

5 

The strategy is tested using DNA polymerases in view of their central role in molecular 
evolution. A maleimidyl group is introduced at the 5' end of a DNA primer; the 
product is tagged by addition of biotinylated dUTP to the 3* end of the primer by the 
catalytic action of the polymerase. The Klenow and Stoffel fragments of DNA 
10 polymerase I Escherichia coli and Thermus aquaticus, respectively, are cloned for 
display by fusion to the pill coat protein of filamentous bacteriophage by conventional 
techniques. Both fragments lack the 5' to 3' exonuclease domains; the Stoffel fragment 
also lacks a 3' to 5' exonuclease activity. 

15 The fusion protein is cloned on a phagemid (pHENl) [25], and is rescued by a helper 
phage. The polymerase fragments are shown to be displayed on the phage (after rescue 
with helper phage) by binding of the phage to wells coated with anti-polymerase 
antibodies as detected by ELISA (not shown). The phage are also analysed by Western 
blot using anti-p3 or anti-polymerase antibodies. This confirms the presence of the 

20 fusion protein, but also indicates contamination by free polymerase. Presumably this 
arises by secretion from the bacterial host through incomplete suppression of the amber 
stop codon or by cleavage from the phage surface. This is removed by a further step of 
ultracentrifugation or by size exclusion chromatography. The purified phages are 
assayed for DNA polymerase activity in a primer/template extension assay with 

25 radioactively labelled a32p-dCTP and found to be active. 

However as is indicated by the Western blots, the polymerase-p3 fusion protein is 
poorly incorporated into the phage compared to the p3 protein. This appears to be due 
to incorporation of p3 from the helper phage, as shown by the alternative use of a 
30 helper phage (KM 13) in which the p3 protein of the helper phage (but not that of fusion 
protein) can be cleaved with trypsin so as to render it incapable of mediating infection 
(Examples 3 and 4). Thus after proteolysis only those phages that had incorporated the 
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fusion protein are infective; from the loss in titre after proteolysis we estimate that only 
one phage particle in a thousand had incorporated the fusion protein. The selection 
process, relying on tagging by polymerase in cis, would be compromised by such a 
great excess of phages lacking the polymerase but available for tagging in trans. 
5 Selected phages are therefore treated with trypsin to destroy the infectivity of those 
lacking the displayed polymerase. 

The phage displaying the Stoffel fragment are incubated with primer 13 [TTT CGC 
AAG ATG TGG CGT] comprising a 5' maleimidyl group and a 3* biotinylated 

lo nucleotide. After incubation the phage are captured on streptavidin-coated beads f with a 
yield of about 1 - 5 % of infectious phage. This shows that primer can be chemically 
cross-linked to the phage, presumably via p8 protein as shown for the N-biotinoyi-N'- 
(6-maleimidohexanoyl) hydrazide. The phage are then incubated with primer lb 
[GCGAAGATGTGG] comprising a 5' maleimidyl group in the presence of biotin-dUTP 2 

15 and template 3 [AAA TAC AAC aat AAA ACG CCA cat ctt GCG], Capture of 
the phage is dependent on presence of lb, 2 and 3 (Table 8), but also on the inclusion 
of trypsin, which cleaves the helper phage to reduce non-specific phage isolation. 
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Table 8. Selection of catalytically active phage-Stoffel particles. 

[a] ^7Ib] Yield Cond::icns 

in tu in tu in % 



s 


. 4xi0 5 


2 


. OxlO 4 


2 


.4 




3 


. 6X10 5 




. OxlO 2 


0 


. 028 


- primer lb 




. 4xl0 5 


3 


. OxlO 2 


0 


.063 


- biotzinylated dUTP 2 


4 


. 8xl0 5 


3 


. OxlO 2 


0 


.062 


- template 3 


4 


.4x10 s 


4 


. OxlO 6 


0 


.091 


- trypsin 


T 
X 


.5x10^ 




.5xl0 5 


0 


.037 


- trypsin, - primer 1 



cpi and of denote the number of transformation units (tu) prior [a] and after 
selection. Yield = cpf / (pi. 

[c]: + primer lb, + biotinylated dUTP 2, + template 3 and + trypsin. 

Example 9. Selection for disulphide-containing polypeptides. 

For the cloning of (polypeptide encoding DNA fragments and their display for s 
between barnase and p3, the phage fd-3 is constructed (Fig. 5). Phage fd-3 comp 
H102A mutant of barnase N-terminally fused to the p3 gene of phage fd-TET. I 
the codon for the last residue of barnase and the first residue of p3 is the ni 
sequence CTG CAG GCG GTG CGG CCG CA. This sequence contains a P< 
restriction site (in italics) for insertion of DNA fragments flanked by PstI restnct 
The sequence further introduces a frame shift between barnase and p3 ; which 
expression of the correct p3 reading frame in fd-3. Phage particles of phage fd-3 
do not display the infection protein p3 and are non-infectious. 

Phage fd-3 is therefore well suited as a cloning vector as vectors without F 
inserts after ligation are not propagated during selection. Statistically 1 out of 
DNA inserts in the Pstl restriction site will (except for the presence of stc 
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within the insert) create an open reading frame spanning barnase, the insert itself and p3 
and result in infectious phage particles containing p3 in the phage coat. In these 
recombinant clones barnase is followed by the insert, which is then followed by the 
amino acid residues AGGAAA before the start of the p3 protein. This AGGAAA 
5 peptide should provide enough flexibility in the fusion protein to enable the infectivity 
function of p3 and the access of the protease to the N-terminal appendices of p3. 

Genomic DNA from the E.coli strain TGI is amplified in 30 cycles of a polymerase 
chain reaction (PCR) with an annealing temperature of 48°C using the oligonucleotide 

10 SN6MIX (5 '-GAG CCT GCA GAG CTC AGG NNN NNN-3'), which comprises 6 
degenerate positions at the extendible 3' end to ensure random priming. In a second step 
of 30 PCR cycles with an annealing temperature of 52°C primary PCR products are 
extended by re-amplification with the oligonucleotide XTND (5'-CGT GCG AGC CTG 
CAG AGC TCA GG-3*)- Products with a length of around 150 bp from this reaction 

15 are purified from an agarose gel and reamplified in 30 PCR cycles using an annealing 
temperature of 52°C and the oligonucleotide XTND. These reamplified 150 bp 
fragments are partially digested with Sad (site indicated in bold in the oligonucleotides) 
and ligated for dimerisation. Ligated products are reamplified in a further 10 PCR 
cycles with an annealing temperature of 44°C followed by a 30 PCR cycles with an 

20 annealing temperature of 55°C using the oligonucleotide XTND. The annealing 
temperatures are chosen to discriminate against priming of the oligonucleotide in the 
middle of the dimerised fragments. The reaction product is size purified twice on an 
agarose gel to remove monomers and oligomers (non-dimers). 



The final dimer fraction is amplified by PCR using an annealing temperature of 55°C 
and the oligonucleotide XTND on a large scale, digested with PstI (site indicated in 
italics in oligonucleotides) and ligated into the also PstI digested and phosphatased 
vector fd-3. After electroporation into E.coli bacteria a repertoire of 3.6xl0 7 
recombinants is obtained. Sequence analysis of randomly picked clones reveal the 
presence of mainly dimeric (11 out of 14) and some monomeric (2 out of 14) DNA 
inserts. 
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Reinfection of E.colt bacteria with phage produced from the initial population of 
transformed cells yields, according to sequence analysis of twenty randomly picked 
clones, a library of infectious phages containing almost exclusively barnase-(in-frame, 
no-stop-dimer-insert)-p3 fusions. Non-infectious phages arising from vector without 
5 insert and from vector with out-of-frame or stop-codon containing inserts are not 
propagated in the infection step. The vector fd-3 is therefore suitable to create a 
repertoire of polypeptides randomly generated through dimerisation of DNA fragments 
from the E.coli genome. 

10 This repertoire of polypeptides displayed as an inserted fusion between barnase and p3 
on phage fd is subjected to proteolytic digestion with trypsin and thermolysin alone or a 
mixture of both (1 ng/jil each) in TBS-Ca buffer (25 mM Tris, 137 mM NaCl, ImM 
CaCb, pH 7.4). After proteolysis phage is captured with biotinylated barstar bound to a 
Streptavidin coated microtitre well plate and eluted at pH 2.0. Phage is neutralised to 

15 pH 7 and propagated through reinfection of the Exoli cells and selected for a second 
round as before. All steps are performed in the absence of any reducing agent like 
dithiothreitol (DTT) or p-mercaptoethanol, which would reduce and thereby cleave any 
potential disulphide bonds formed between cysteine sidechains within selected 
polypeptides. 

20 

Randomly picked phages, which are eluted after the first and second round of 
proteolytic selection, are analysed for binding to barstar after incubation with a mixture 
of trypsin and thermolysin under the conditions of the selection (Table 9). Their 
sequence is determined (Table 9). 17 out of 18 analysed clones treated with a mixture of 

25 trypsin and thermolysin during selection are found to bind biotinylated barstar after 
incubation with trypsin and thermolysin. 5 out of 8 analysed clones treated with trypsin 
during selection bound biotinylated barstar after incubation with trypsin and 
thermolysin, whilst 9 of 14 analysed clones treated with thermolysin are found to bind. 
No randomly picked clones not treated with a protease during selection bind 

30 biotinylated barstar after incubation with trypsin and thermolysin. Binding to 
biotinylated barstar shows that the polypeptide insertion between barnase and p3 on the 
phage retains its overall covalent integrity and therefore keeps the N-terminal tag 



(barnase) and the infection protein p3 (and thereby the phage particle as a whole) 
covalently linked. 

However, the possibility of proteolytic digestion of the peptide backbone can not be 
excluded, as the inserts may also be kept covalently linked through bonds between 
sidechain groups like the SHb groups of cysteines. Sequence analysis of the selected 

clones reveal that 19 of 25 (76%) resistant clones contain two or more cysteine 
residues. 

To analyse the role of possible disulphide bonds in the polypeptide inserts, 13 of the 
selected phages are analysed for binding to biotinylated barstar (and thereby for a 
covalent linkage of barnase and p3 through the insert) after treatment with trypsin and 
thermolysin followed by a wash with 20 mM DTT before detection of bound phage. 2 
phage clones, which bind barstar after protease treatment without DTT wash and 
contain no cysteines, are unaffected by the DTT treatment. Another 2 phage clones, 
which bind barstar after protease treatment without DTT and contain two or more 
cysteines, are also observed to bind barstar after protease and DTT treatment. This 
suggests that their inserts are protected from proteolysis of their peptide backbone 
despite the presence of principle substrate sites for the proteases. 

These inserts are therefore protected from proteolytic attack due to a conformational 
restraint of their peptide backbone. However, 9 phage clones which bind barstar after 
protease treatment without DTT wash and contain two or more cysteines are observed 
not to bind barstar after protease and DTT treatment. This suggests that their inserts are 
proteolytically cleaved in their peptide backbone, but are held together by disulphide 
bonds between cysteine sidechains in the absence of the reducing agent DTT. These 
inserts are therefore not protected from proteolytic attack due to a conformational 
restraint of their peptide backbone. 

Thus, the method of the present invention may be configured to select for cysteine- 
containing polypeptides, even where the polypeptides would by susceptible to protease 
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attack since the polypeptides are capable of being held together in the selection step by 
disulphide bonds. 

Table 9. Barstar binding of phages displaying barnase-p3 fusion inserts selected after 
5 proteolytic treatment under non-reducing conditions and amino acid sequences of 
their PstI inserts in vector fd-3. 

Barstar binding (-DTT) after proteolysis of phage with trypsin and thermolysin (2ng/ul 
, each) without a 20 mM DTT wash is determined by measurement of a signal that is at 

10 least 60% of the signal for barstar binding of phage without protease treatment. Barstar 
binding (+DTT) after a 20 mM wash is determined by measurement of a signal that is at 
least 60% of the signal for barstar binding (-DTT) without a 20 mM DTT wash. Phage 
are randomly picked after one (lx) or two (2x) rounds of proteolysis with trypsin (Tr) or 
thermolysin (Th). Phages are treated with proteases, captured with biotinylated barstar in 

15 microtitre well plates, and washed with 20 mM DTT where applicable. Bound phage are 
detected with a horse radish peroxidase conjugated anti-M13 phage antibody (Pharmacia 
Biotech). 
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Phage Proteolytic Barstarbindg Amino acid sequence 

clone selection -DTT +DTT of inserts 



TA-1.2 lxTr 



TA-2.25 2xTr 



TA-2.2 6 2xT> 



10 • TA-2.2 7 2xT: 



TA-2.3 0 2xTr 



15 



TB-1.10 lxTh 

TB-1.11 lxTh 

TB-2.3 3 2xTh 

TB-2.34 2xTh 

TB-2.3 5 2xTh 

TB-2.37 2xTh 



25 T3-2.3 8 2xTh 
TB-2.39 2xTh 
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yes no LQSSGDCVIS DTCIAGMAEA AACEEKFSSQ 

NVGLTITVTP CLSSA 
yes no LQSSGCGSSG SSINCLPCGA TSRGTSPLAS 

GLPSSATIHC LSSA 
yes no LQSSGDSAGC KNMTGGRLYA KTLEAIIPG? 

AVSAPACEPA 

yes yes LQSSGCVRLK RTSVNHQPDA WPEPHLKAAC 
EPA 

yes no LQSSGCGSSG SSINCLPCGA TSRGTSPLAS 

GLPSSATVQC LSSA 
yes yes LQSSGKIVQA GANIQDGCIM HGYCDTDTIV 

GENGKIGLSS A 
yes yes no insert, Barnase & p3 in frame 
yes no LQSSGVCVIS DTCIAGTAEA AACEEKFSSQ 

NVGHTITETP CLSSA 
yes no LQSSGCGSSG SSINCLPCGA TSRGTSPLAS 

GLPSSATIQC LSSA 
yes no LQSSGQDSQR EHASHTAEDD CEDQTRIHQH 

IREVDF VDTP QEVDDCRAAL SSA 
yes no LQSSGCVRLK RTSVNHQPDA WPEPHLKAAC 

EPA 

yes yes LQSSGVRPA 

yes no LQSSGCGSS GSSINCLPCGA TSRGTSPLAS 
GLPSSATIQ CLSSA 
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Example 10. Use of reducing agents to eliminate disulphide-related background. 

The repertoire of polypeptides described in Example 9 is digested as before with both 
trypsin and thennolysin (Example 9) except for an additional washing step here with 50 
5 mM DTT after binding of the proteolytically treated phage to the microtitre well plates 
coated with Streptavidin-biotinylated barstar. This washing step is designed to wash off 
phage displaying an N-terminal barnase tag, which is no longer linked to p3 through an 
intact polypeptide backbone but only through disulphide bonds between cysteine 
. sidechains in the polypeptide inserts between the barnase tag and p3. 

10 

12 randomly picked phages eluted after the second round of proteolysis are analysed for 
stability against a mixture of trypsin and thermolysin under the conditions of the 
selection and their sequence is determined (Table 10). 10 clones treated with a mixture 
of trypsin and thermolysin during selection bind biotinylated barstar after incubation 
15 with trypsin and thermolysin followed by washing with 50 mM DTT before detection of 
captured phage. Only one of these clones contain two or more cysteines. 

Proteolytic treatment of the phage library followed by a wash with DTT therefore 
allows the selection of peptide inserts which are protected from proteolysis and which 
20 are not held together through disulphide bonds. 
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Table 10. Barstar binding of phages displaying barnase-p3 fusion inserts selected 
after proteolytic treatment followed by treatment with 50 mM DTT and amino acid 
sequences of their PstI inserts in fd-3. Barstar binding (H-DTT) after proteolysis of 
phage with trypsin and thermolysin (2 ng/|il each) followed by a 50 mM DTT is 

5 determined by measurement of a signal for Barstar binding (4- DTT), which is at least 
60% of the signal for barstar binding of phage without protease treatment. Phage are 
randomly picked after two (2x) rounds of proteolytic with trypsin (Tr) and/or 
thermolysin (Th). Phages are treated with proteases, captured with biotinylated barstar 
in microtitre well plates, and washed with 50mM DTT where applicable. Bound phage 

10 is detected with a horse radish peroxidase conjugated anti-M13 phage antibody. 

Phage Proteolytic Barstarbindg Amino acid sequence 



Clone 


selection 


4- DTT 


of inserts 






B2 


-13 


2xTr/Th 


yes 


LQSSGTEVDR 
A 


GNQQHDTNDR 


DFTHTPLSS 


B2 


-14 


2xTr/Th 


yes 


LQSSGSVAQG 


SSASVDVTAX 


NAVLSADSL 










SLGGGEPA 






B2 


-22 


2xTr/Th 


yes 


LQSSGGAVAV 


TPGPVLSSA 




B2 


-23 


2xTr/Th 


yes 


LQSSGHCRGK 


PVLCTKTA 




B2 


-15 


2xTr/Th 


yes 


LQSSGVRPA 






B2 


-17 


2xTr/Th 


yes 


no insert/ 


Barnase & : 


p3 in frame 


32 


-20,21 


2xTr/Th 


yes 


no insert, 


Barnase h ; 


p3 in frame 


32 


-15,24 


2xTr/7h 


yes 


LQSSGEPAPA 


KEAKPTEAPV 


AKAEAKPEX? 










AKLSSA 






E2 


-16 


2xTr/Th 


no 


LQSSGCVRLK 


RTSVKHQPDA 


WPEPHLKAAC 










EPA 






32 


-19 


2xTr/?h 


no 


LQSSGWDWA 


KMREIADSIG 


AYL F VDMAKV 



AALSSA 
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Brief description of Figures 

Figure 1. Cleavage of phages with protease sites. Phages were prepared by rescue 
with KM13 (pHENl, A + B), or with VCSM13 (pKl, C + D). Uncieaved (A + C) or 
5 cleaved with trypsin (B 4- D). 5 ^1, 2.5 yl and 1 p\ phages were loaded as indicated. 
Molecular weight markers are in kD. 

Figure 2. The phagemid vectors pKl and pK2. These vectors contain a protease 
cleavable sequence between D2 and D3 of the phage p3 protein. In pKl, D2 4- D3 are 
10 in frame; in pK2, D3 is out of frame. 

Figure 3. Binding of phage-barnase to barstar. Phage displaying different fusion 
protein are incubated with biotinoylated barstar captured on streptavidin-coated plate and 
detected by ELISA. a) barnase mutant A, b) barnase mutant B, c) villin , d) no phage. 

15 

Figure 4. Temperature denaturation of phage fusion proteins. Phagemids were 
rescued with KM 13, infectivity (TU/ml) shown after incubation and cleavage with 
trypsin at given temperatures. Fusion with villin subdomain (triangles), barnase mutant 
A (diamonds), barnase mutant B (squares), pHENl -chloramphenicol resistant (circles). 

20 

Figure 5. The fd vector fd-3. The gene for the H102A mutant of Barnase is introduced 
by subcloning into fd-DOG [43] after PCR amplification with suitable oligonucleotides 
using the restriction sites ApaLI (at the Barnase 5' end) and NotI to create fd-3. 

25 Figure 6. Map of phagemid vector used for display of Stoffel fragment on the 
surface of phage. The asterisk shows sequences that are randomised at least partially in 
libraries I and II. 
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Claims 

1 . A method for the selection of a virus comprising the steps of: 

(a) providing a virus encoding and displaying a fusion ploypeptide, said fusion polypeptide 
comprising a heterologous polypeptide inserted into the sequence of a viral coat protein 
polypeptide, wherein said virus comprises a cleavable site located within a displayed 
polypeptide; 

(b) exposing the virus to a cleaving agent; 

(c) propagating the virus comprising intact fusion protein. 

2. A method according to claim 1 in wherein the cleavage site is located within the fusion 
polypeptide . 

3. :"I A method according to claim 2 wherein after cleavage, the virus comprising uncleaved 
fudlbn polypeptide is separated from virus comprising cleaved fusion polypeptide. 

4.I P A method according to claim 1 wherein cleavage impairs the ability of the polypeptide 
comprising the cleavage site to mediate the infection of the virus. 

J 5 Q A method according to claim 1 , wherein the virus encodes a repertoire of sequences. 

6. iwsf A method according to claim 5, wherein the repertoire of sequences encodes the 
displayed heterologous peptide or protein. * 

7. A method according to any one of claims 5 or 6 in which the cleavable site is comprised 
within the repertoire of sequences. 

) 8. A method according to claim 1, wherein a virus that is resistant to cleavage is propagated 
by infection. 

9. A method according to claim 8 in which a virus which is resistant to cleavage displays a 
folded protein or polypeptide. 
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SELECTION SYSTEM 

The present invention concerns a method for the selection of a virus comprising the steps of : (a) 
providing a virus encoding and displaying a fusion polypeptide, said fusion polypeptide 
comprising a heterologous polypeptide inserted into the sequence of a viral coat protein 
polypeptide, wherein said virus comprises a cleavable site located within a displayed 
polypeptide; (b) exposing the virus to a cleaving agent; (c) propagating the virus comprising 
intact fusion protein. 
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Figure 2. 
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Figure 4. 
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Figure 6. 
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